An efficient strategy for mining exceptions in multi-databases

نویسندگان

  • Shichao Zhang
  • Chengqi Zhang
  • Jeffrey Xu Yu
چکیده

This paper proposes a new strategy, referred to as local instance analysis, for multidatabase mining. While many interstate organizations have an imperative need to analyze their data in multi-databases distributed throughout their branches, traditional multi-database mining utilizes the strategies for mono-database mining: pooling all the data from relevant databases into a single dataset for discovery. This leads to the destruction of useful information, for instance, 70% of branches within a company agreed that a married customer usually has at least 2 cars if his/her age is between 45 and 65’. This information assists in global decision-making within the company. Our new strategy is developed for discovering this useful information. Using the local instance analysis, we design an algorithm for identifying exceptions from multi-databases. Exceptional pattern reflects the individuality’ of, say, branches of an interstate company. 2003 Published by Elsevier Inc. * Corresponding author. Address: Faculty of Information Technology, University of Technology, Sydney, P.O. Box 123,Broadway, NSW 2007, Australia. Fax: +61-2-9514-1807. E-mail addresses: [email protected] (S. Zhang), [email protected] (C. Zhang), [email protected] (J.X. Yu). 0020-0255/$ see front matter 2003 Published by Elsevier Inc. doi:10.1016/j.ins.2003.10.008 2 S. Zhang et al. / Information Sciences xxx (2003) xxx–xxx ARTICLE IN PRESS

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identifying Global Exceptional Patterns in Multi-database Mining

In multi-database mining, there can be many local patterns (frequent itemsets or association rules) in each database. At the end of multi-database mining, it is necessary to analyze these local patterns to gain global patterns, when putting all the data from the databases into a single dataset can destroy important information that reflect the distribution of global patterns. This paper develop...

متن کامل

Fuzzy multi-criteria selection procedures in choosing data source

Technology assessment and selection has a substantial impact on organizations procedures in regards to technology transfer. Technological decisions are usually made by a group of experts, and whereby integrity of these viewpoints to a single decision can be quite complex. Today, operational databases and data warehouses exist to manage and organize data with specific features and henceforth, th...

متن کامل

Multi-Output Adaptive Neuro-Fuzzy Inference System for Prediction of Dissolved Metal Levels in Acid Rock Drainage: a Case Study

Pyrite oxidation, Acid Rock Drainage (ARD) generation, and associated release and transport of toxic metals are a major environmental concern for the mining industry. Estimation of the metal loading in ARD is a major task in developing an appropriate remediation strategy. In this study, an expert system, the Multi-Output Adaptive Neuro-Fuzzy Inference System (MANFIS), was used for estimation of...

متن کامل

Mining the Banking Customer Behavior Using Clustering and Association Rules Methods

  The unprecedented growth of competition in the banking technology has raised the importance of retaining current customers and acquires new customers so that is important analyzing Customer behavior, which is base on bank databases. Analyzing bank databases for analyzing customer behavior is difficult since bank databases are multi-dimensional, comprised of monthly account records and daily t...

متن کامل

Database classification for multi-database mining

Many large organizations have multiple databases distributed in different branches, and therefore multi-database mining is an important task for data mining. To reduce the search cost in the data from all databases, we need to identify which databases are most likely relevant to a data mining application. This is referred to as database selection. For real-world applications, database selection...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Inf. Sci.

دوره 165  شماره 

صفحات  -

تاریخ انتشار 2004